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First Hit Y our wildcard search against 10000 terms has yielded the results below. 

Your result set for the last X# is incomplete. 
The probable cause is use of unlimited truncation. Revise your search strategy to use limited truncation. 
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Generate OACS 





Search Results - Record(s) 1 through 16 of 16 returned. 



D 1. Document ID: US 20040120572 Al 
Using default format because multiple data bases are involved. 

L4: Entry 1 of 16 File: PGPB 



Jun 24, 2004 



PGPUB-DOCUMENT-NUMBER : 20040120572 
PGPUB-FILING-TYPE: new 

DOCUMENT- IDENTIFIER: US 20040120572 Al 

TITLE: Method for using effective spatio-temporal image recomposition to improve 
scene classification 

PUBLICATION-DATE: June 2 4, 2 004 



INVENTOR- INFORMATION: 
NAME 

Luo, Jiebo 
Gray, Robert T. 
Boutell, Matthew R. 



CITY 

Pittsford 
Rochester 
Rochester 



STATE 
NY 
NY 
NY 



COUNTRY 
US 

us 
us 



US-CL-CURRENT: 382/159; 332/224 



Title 1 Citation j Front 



Attachments 



.>s.'.*.:::'.;::- ....................... ....... 



□ 2. Document ID: US 6915025 B2 
L4: Entry 2 of 16 



File: USPT 



Jul 5, 2005 



US-PAT-NO: 6915025 

DOCUMENT- IDENTIFIER: US 6915025 B2 

TITLE: Automatic image orientation detection based on classification of low-level 
image features 

DATE-ISSUED: July 5, 2005 



INVENTOR- INFORMATION: 
NAME 



CITY 



STATE 



ZIP CODE 



COUNTRY 
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Wang; Yongmei 
Zhang; Hong-Jiang 



Hong Kong 
Beijing 



HK 
CN 



US-CL-CURRENT: 382/289; 382/165, 382/190 



□ 3. Document ID: US 6847680 B2 

L4: Entry 3 of 16 File: USPT 

US-PAT-NO: 6847680 

DOCUMENT- IDENTIFIER : US 6847680 B2 

TITLE: Method for detecting talking heads in a compressed video 
DATE- ISSUED: January 25, 2005 



Jan 25, 2005 



INVENTOR- INFO RMAT I ON : 
NAME 

Divakaran; Ajay 
Radhakrishnan; Regunathan 



CITY 

Denville 
Ozone Park 



STATE ZIP CODE 

NJ 

NY 



COUNTRY 



US-CL-CURRENT: 375/240. 01; 375/240. 08 



D-at* Refer* nee 



| j ~~ 



□ 4. Document ID: US 6816847 Bl 

L4: Entry 4 of 16 
US-PAT-NO: 6816847 

DOCUMENT-IDENTIFIER: US 6816847 Bl 



File: USPT 



Nov 9, 2004 



TITLE: computerized aesthetic judgment of images 
DATE-ISSUED: November 9, 2004 



INVENTOR- INFORMATION: 
NAME 

Toyama; Kentaro 



CITY 
Redmond 



STATE 
WA 



ZIP CODE 



COUNTRY 



US-CL-CURRENT: 706/14; 382/156, 382/157, 382/158, 382/159, 382/224, 706/18, 706/20 



Classification 



□ 5. Document ED: US 6760714 Bl 
L4: Entry 5 of 16 



File: USPT 



Jul 6, 2004 



http://westbrs:9000/bin/gate.exe?f=TOC&state=gp5f9f.7&ref=4&dbname=PGPB,USPT,US.. 
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US -PAT-NO : 6760714 

DOCUMENT- IDENTIFIER : US 6760714 Bl 

** See image for Certificate of Correction ** 

TITLE: Representation and retrieval of images using content vectors derived from 
image information elements 

DATE-ISSUED: July 6, 2004 



INVENTOR- IN FORMAT I ON: 
NAME 

Caid; William R. 
Hecht-Neilsen; Robert 



CITY 

San Diego 
Del Mar 



STATE 

CA 

CA 



ZIP CODE 



COUNTRY 



US -CL- CURRENT: 706/14; 382 / 116 , 382/228 



Citation ! 


i Ffon 


i 1 


Review j 



Class if baiion 



Ret* te nee 



□ 6. Document ID: US 6751354 B2 

L4: Entry 6 of 16 



File: USPT 



Jun 15, 2004 



US-PAT-NO: 6751354 

DOCUMENT-IDENTIFIER: US 6751354 B2 

TITLE: Methods and apparatuses for video segmentation, classification, and 
retrieval using image class statistical models 

DATE-ISSUED: June 15, 2004 



INVENTOR-INFORMATION : 
NAME 

Foote; Jonathan T. 
Wilcox; Lynn 
Girgensohn; Andreas 



CITY 

Menlo Park 
Portola Valley 
Menlo Park 



STATE ZIP CODE 

CA 

CA 

CA 



COUNTRY 



US -CL- CURRENT: 382/224; 348 / 395.1 , - 348 / 405.1 , 348 / 408. 1 , 382/190, 382/209, 382/218, 
382/220 



□ 7. Document ID: US 6640145 B2 

L4: Entry 7 of 16 



File: USPT 



Oct 28, 2003 



US-PAT-NO: 6640145 

DOCUMENT-IDENTIFIER: US 6640145 B2 



TITLE: Media recording device with packet data interface 
DATE-ISSUED: October 28, 2003 
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INVENTOR- INFORMATION: 
NAME 

Hoffberg; Steven 

Hof fberg-Borghesani; Linda 



CITY 

West Harrison 
Acton 



STATE ZIP CODE 
NY 10604 
MA 01720 



COUNTRY 



US-CL-CURRENT: 700/83; 700/17, 700/19, 700/23, 704/200, 704/201, 704/7, 709/200, 
709/ 201 , 709/202 



ntle : Citstior 



□ 8. Document ID: US 6487545 Bl 

L4: Entry 8 of 16 



File: USPT 



Nov 26, 2002 



US-PAT-NO: 6487545 

DOCUMENT-IDENTIFIER: US 6487545 Bl 

TITLE: Methods and apparatus for classifying terminology utilizing a knowledge 
catalog 

DATE-ISSUED: November 26, 2002 



INVENTOR-INFORMATION: 

NAME CITY 

Wical; Kelly Redwood Shores 



STATE ZIP CODE 
CA 



COUNTRY 



US-CL-CURRENT: 706/45 



FuSI j Tltie j Cftdtiori j Front ( Review ] Classification ] Date f Reference 






C i 3 i ms ] KWWC j D raw. D * 



□ 9. DocumentID: US6418424B1 

L4: Entry 9 of 16 



File: USPT 



Jul 9, 2002 



US-PAT-NO: 6418424 

DOCUMENT-IDENTIFIER: US 6418424 Bl 

TITLE: Ergonomic man-machine interface incorporating adaptive pattern recognition 
based control system 

DATE-ISSUED: July 9, 2002 



INVENTOR-INFORMATION: 
NAME 

Hoffberg; Steven M. 

Hof fberg-Borghesani; Linda I, 



CITY 

West Harrison 
Acton 



STATE ZIP CODE COUNTRY 
NY 10604 
MA 01720 



US-CL-CURRENT: 706/21; 434/178, 706/52 
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□ 10. Document ID: US 6404925 Bl 

L4: Entry 10 of 16 File: USPT Jun 11, 2002 

US-PAT-NO: 6404925 

DOCUMENT- IDENTIFIER : US 6404925 Bl 

TITLE: Methods and apparatuses for segmenting an audio-visual recording using image 
similarity searching and audio speaker recognition 

DATE- ISSUED : June 11, 2002 

INVENTOR-INFORMATION : 

NAME CITY 

Foote; Jonathan T. Menlo Park 

Wilcox; Lynn Portola Valley 



STATE ZIP CODE COUNTRY 

CA 

CA 



US-CL-CURRENT: 382 /224; 348 / 480 , 348/484, 358/403, 382 / 159 , 382/173, 382 / 190 , 
382/197, 382/209, 382/225, 382/227, 382/276, 382/305, 704/239, 704/243, 704/245, 
707/1, 707/3 



oni j Reviev 



□ 11. Document ID: US 6400996 Bl 

L4: Entry 11 of 16 



File: USPT 



Jun 4, 2002 



US-PAT-NO: 6400996 

DOCUMENT-IDENTIFIER: US 6400996 Bl 



TITLE: Adaptive pattern recognition based control system and method 
DATE-ISSUED: June 4, 2002 



INVENTOR-INFORMATION: 
NAME 

Hoffberg; Steven M. 

Hof fberg-Borghesani; Linda I. 



CITY 

West Harrison 
Acton 



STATE ZIP CODE COUNTRY 
NY 10994 
MA 01720 



US-CL-CURRENT: 700 /83; 370/ 218 , 370 / 355 , 700 /17, 700 /24, 700 /25, 700 /86, 700 /87, 
709/223, 709/227, 715/810, 715/840, 715/841, 718/102, 719/318 



□ 12. Document ID: US 6397200 Bl 

L4: Entry 12 of 16 



File: USPT 



May 28, 2002 



US-PAT-NO: 6397200 
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DOCUMENT- IDENTIFIER : US 6397200 Bl 



TITLE: Data reduction system for improving classifier performance 
DATE-ISSUED: May 28, 2002 



INVENTOR-INFORMATION: 
NAME 

Lynch, Jr.; Robert S. 
Willett; Peter K. 

US -CL- CURRENT: 706/20; 702/181 



CITY 

Groton 

Coventry 



STATE 

CT 

CT 



ZIP CODE 



COUNTRY 



□ 13. Document ID: US 6173275 Bl 

L4: Entry 13 of 16 



File: USPT 



Jan 9, 2001 



US- PAT-NO : 6173275 

DOCUMENT-IDENTIFIER: US 6173275 Bl 

** See image for Certificate of Correction ** 

TITLE: Representation and retrieval of images using context vectors derived from 
image information elements 



DATE-ISSUED: January 9, 2001 
INVENTOR- INFORMATION: 

NAME CITY STATE 

Caid; William R. San Diego CA 

Hecht-Neilsen; Robert Del Mar CA 



ZIP CODE 



COUNTRY 



US-CL-CURRENT: 706/14; 382/190, 382/195, 382/224, 382/225, 706/12, 706/934 



□ 14. Document ID: US 6072542 A 

L4: Entry 14 of 16 



File: USPT 



US-PAT-NO: 6072542 

DOCUMENT-IDENTIFIER: US 6072542 A 

TITLE: Automatic video segmentation using hidden markov model 
DATE-ISSUED: June 6, 2000 



INVENTOR-INFORMATION : 
NAME 



CITY 



STATE ZIP CODE 



Jun 6, 2000 



COUNTRY 
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Wilcox; Lynn D. Portola Valley CA 

Boreczky; John S. San Leandro CA 

US-CL-CURRENT: 348/722; 348/700, 348/701, 382/155, 382/156, 382/157, 382/160, 

386/68, 386/69, 386/95, 386/96 



O 15. Document ID: US 5987459 A 

L4: Entry 15 of 16 



File: USPT 



Nov 16, 1999 



US-PAT-NO: 5987459 

DOCUMENT-IDENTIFIER: US 5987459 A 

** See image for Certificate of Correction ** 

TITLE: Image and document management system for content-based retrieval 
DATE-ISSUED: November 16, 1999 



INVENTOR- INFORMATION: 
NAME 

Swanson; Mitchell D. 
Tewfik; Ahmed H. 
Hosur; Srinath 



CITY 

Minneapolis 

Edina 

Piano 



STATE ZIP CODE 

MN 

MN 

TX 



COUNTRY 



US-CL-CURRENT: 707 /6; 707 /3, 707 /9 



n 16. Document ID: US 5822465 A 
L4: Entry 16 of 16 



File: USPT 



Oct 13, 1998 



US-PAT-NO: 5822465 

DOCUMENT-IDENTIFIER: US 5822465 A 



TITLE: Image encoding by vector quantization of regions of an image and codebook 
updates 

DATE-ISSUED: October 13, 1998 



INVENTOR-INFORMATION : 
NAME 

Normile; James Oliver 
Wang; Katherine Shu-Wei 



CITY STATE ZIP CODE COUNTRY 

Sunnyvale CA 
San Jose CA 



US-CL-CURRENT: 382/253; 382/239 
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Search Results - Record(s) 1 through 7 of 7 returned. 



□ 1. Document ID: US 675 1354 B2 
Using default format because multiple data bases are involved. 



L6: Entry 1 of 7 



File: USPT 



Jun 15, 2004 



US-PAT-NO: 6751354 

DOCUMENT- IDENTIFIER : US 6751354 B2 

TITLE: Methods and apparatuses for video segmentation, classification, and 
retrieval using image class statistical models 

DATE-ISSUED: June 15, 2004 



INVENTOR-INFORMATION : 
NAME 

Foote; Jonathan T. 
Wilcox; Lynn 
Girgensohn; Andreas 



CITY STATE 

Menlo Park CA 

Portola Valley CA 

Menlo Park CA 



ZIP CODE 



COUNTRY 



US-CL-CURRENT: 382 / 224 ; 348 / 395. 1 , 348 / 405. 1 , 348 / 408. 1 , 382 / 190 , 382/ 209 , 382 / 218 , 
382/220 



Classification 



n 2. Document ID: US 6678413 Bl 

L6: Entry 2 of 7 



File: USPT 



Jan 13, 2004 



US-PAT-NO: 6678413 

DOCUMENT-IDENTIFIER: US 6678413 Bl 

TITLE: System and method for object identification and behavior characterization 
using video analysis 

DATE-ISSUED: January 13, 2004 



INVENTOR-INFORMATION : 

NAME CITY 

Liang; Yiqing Vienna 

Crnic; Linda Denver 

Kobla; Vikrant Ashburn 

Wolf; Wayne Princeton 



STATE 

VA 

CO 

VA 

NJ 



ZIP CODE 

22182 

80224 

20147 

08540 



COUNTRY 
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US-CL-CURRENT: 382/181; 348/169 



— 



Citation i Fioni Review j Classification 



Re faience 



□ 3. Document ID: US 6574378 Bl 

L6: Entry 3 of 7 



File: USPT 



Jun 3, 2003 



US-PAT-NO: 6574378 

DOCUMENT- IDENTIFIER : US 6574378 Bl 

TITLE: Method and apparatus for indexing and retrieving images using visual 
keywords 

DATE-ISSUED: June 3, 2003 



INVENTOR-INFORMATION : 

NAME CITY 

Lim; Joo Hwee Singapore 



STATE ZIP CODE 



COUNTRY 
SG 



US-CL-CURRENT: 382/305; 358/403 



Citation j Fioni {"""Review j Classification J Date j" Reference] 



□ 4. Document ID: US 6081750 A 
L6: Entry 4 of 7 File: USPT Jun 27, 2000 

US-PAT-NO: 6081750 

DOCUMENT-IDENTIFIER: US 6081750 A 

TITLE: Ergonomic man-machine interface incorporating adaptive pattern recognition 
based control system 

DATE-ISSUED: June 27, 2000 

INVENTOR- INFORMATION: 

NAME CITY STATE ZIP CODE COUNTRY 

Hoffberg; Steven Mark Yonkers NY 10701-1705 

Hof fberg-Borghesani; Linda Irene Acton MA 01720 

US-CL-CURRENT: 700/17; 345/520, 700/11, 700/56, 700/83, 700/86 

— '■' . " . ".■■ " . | » ' ., i ... « ^^^^^^^^M^ p» ^^^^^^M^^^^^^^^^ g 



□ 5. Document ID: US 6072542 A 
L6: Entry 5 of 7 File: USPT Jun 6, 2000 

US-PAT-NO: 6072542 
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DOCUMENT- IDENTIFIER : US 6072542 A 

TITLE: Automatic video segmentation using hidden markov model 
DATE-ISSUED: June 6, 2000 
I NVENTOR- I N FORMAT I ON : 

NAME CITY STATE ZIP CODE COUNTRY 

Wilcox; Lynn D. Portola Valley CA 

Boreczky; John S. San Leandro CA 



US-CL-CURRENT: 348/722; 348/700, 348/701, 382/155, 382/156, 382/157, 382/160, 
386/6.8 , 386/69, 386/95, 386/96 



□ 6. Document ED: US 5875108 A 

L6: Entry 6 of 7 



File: USPT 



Feb 23, 1999 



US-PAT-NO: 5875108 

DOCUMENT-IDENTIFIER: US 5875108 A 

TITLE: Ergonomic man-machine interface incorporating adaptive pattern recognition 
based control system 

DATE-ISSUED: February 23, 1999 



INVENTOR-INFORMATION : 
NAME 

Hoffberg; Steven M. 

Hof fberg-Borghesani; Linda I, 



CITY 

Yonkers 

Acton 



STATE 

NY 

MA 



ZIP CODE 

10701-1705 

01720 



COUNTRY 



US-CL-CURRENT: 700 /17; 382 / 181 , 382 / 190 , 700 /83 



n 7. Document ID: US 5822465 A 

L6: Entry 7 of 7 File: USPT Oct 13, 1998 

US-PAT-NO: 5822465 

DOCUMENT-IDENTIFIER: US 5822465 A 

TITLE: Image encoding by vector quantization of regions of an image and codebook 
updates 

DATE-ISSUED: October 13, 1998 
INVENTOR-INFORMATION: 
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NAME CITY STATE ZIP CODE COUNTRY 

Normile; James Oliver Sunnyvale CA 

Wang; Katherine Shu-Wei San Jose CA 



US-CL-CURRENT: 382 / 253 ; 382 / 239 
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Documents 


L5 and (multimedia or multi-media) and 
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Search Results - Record(s) 1 through 12 of 12 returned. 



□ 1. Document ID: US 6870962 B2 
Using default format because multiple data bases are involved. 



L8: Entry 1 of 12 



File: USPT 



Mar 22, 2005 



US-PAT-NO: 6870962 

DOCUMENT-IDENTIFIER: US 6870962 B2 

TITLE: Method and apparatus for efficiently encoding chromatic images using non- 
orthogonal basis functions 

DATE-ISSUED: March 22, 2005 



INVENTOR-INFORMATION : 
NAME 

Lee; Te-Won 
Wachtler; Thomas 
Sejnowski; Terrence J. 



CITY 

San Diego 
Freiburg 
Solana Beach 



STATE ZIP CODE 
CA 

CA 



COUNTRY 



DE 



US-CL-CURRENT: 382/248; 382/233, 382/240 



icatiorTf Date 



D 2. Document ID: US 6826524 Bl 

L8: Entry 2 of 12 File: USPT 

US-PAT-NO: 6826524 

DOCUMENT-IDENTIFIER: US 6826524 Bl 

TITLE: Sample-adaptive product quantization 

DATE-ISSUED: November 30, 2004 



Nov 30, 2004 



INVENTOR-INFORMATION : 
NAME 

Kim; Dong Sik 
Shroff; Ness B. 



CITY 

Sungnam-Si 
Lafayette 



STATE 



IN 



ZIP CODE 



COUNTRY 
KR 



US-CL-CURRENT: 704/200 
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D 3. Document ID: US 6760714 Bl 

L8: Entry 3 of 12 



File: USPT 



Jul 6, 2004 



US- PAT-NO : 6760714 

DOCUMENT-IDENTIFIER: US 6760714 Bl 

** See image for Certificate of Correction ** 

TITLE: Representation and retrieval of images using content vectors derived from 
image information elements 

DATE-ISSUED: July 6, 2004 

I NVENTOR- I NFORMAT I ON : 

NAME CITY STATE ZIP CODE COUNTRY 

Caid; William R. San Diego CA 

Hecht-Neilsen; Robert Del Mar CA 

US-CL-CURRENT: 706/14; 382/116, 382/228 



US-PAT-NO: 6738512 

DOCUMENT-IDENTIFIER: US 6738512 Bl 

TITLE: Using shape suppression to identify areas of images that include particular 
shapes 

DATE-ISSUED: May 18, 2004 
INVENTOR-INFORMATION : 

NAME CITY STATE ZIP CODE COUNTRY 

Chen; Xiangrong Beijing CN 

Zhang; Hong- Jiang Beijing CN 

US-CL-CURRENT: 382/176; 382/200, 382/203 




G 4. Document©: US 6738512B1 

L8: Entry 4 of 12 



File: USPT 



May 18, 2004 




1 



□ 5. Document ID: US 6571016 Bl 

L8: Entry 5 of 12 



File: USPT 



May 27, 2003 



US-PAT-NO: 6571016 

DOCUMENT-IDENTIFIER: US 6571016 Bl 
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** See image for Certificate of Correction ** 

TITLE: Intra compression of pixel blocks using predicted mean 
DATE-ISSUED: May 27, 2003 



INVENTOR- IN FORMAT I ON: 
NAME 

Mehrotra; Sanjeev 
Wang; Albert S. 



CITY 

Kirkland 
Kirkland 



STATE 

WA 

WA 



ZIP CODE 



COUNTRY 



US-CL-CURRENT: 382/236; 375/240. 13 



□ 6. Document ID: US 6404923 Bl 

L8: Entry 6 of 12 File: USPT Jun 11, 2002 

US-PAT-NO: 6404923 

DOCUMENT-IDENTIFIER: US 6404923 Bl 

** See image for Certificate of Correction ** 

TITLE: Table-based low-level image classification and compression system 

DATE-ISSUED: June 11, 2002 

INVENTOR-INFORMATION: 

NAME CITY STATE ZIP CODE COUNTRY 

Chaddha; Navin Stanford CA 



US-CL-CURRENT: 382/224; 382/240, 382/253 



□ 7. Document ID: US 6397200 Bl 
L8: Entry 7 of 12 File: USPT May 28, 2002 

US-PAT-NO: 6397200 

DOCUMENT-IDENTIFIER: US 6397200 Bl 

TITLE: Data reduction system for improving classifier performance 

DATE-ISSUED: May 28, 2002 

INVENTOR-INFORMATION: 

NAME CITY STATE ZIP CODE COUNTRY 

Lynch, Jr.; Robert S. Groton CT 

Willett; Peter K. Coventry CT 
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US-CL-CURRENT: 706/20; 702/181 



□ 8. Document ID: US 6173275 Bl 

L8: Entry 8 of 12 



File: USPT 



Jan 9, 2001 



US-PAT-NO: 6173275 

DOCUMENT-IDENTIFIER: US 6173275 Bl 

** See image for Certificate of Correction '** 

TITLE: Representation and retrieval of images using context vectors derived from 
image information elements 

DATE-ISSUED: January 9, 2001 



I NVENTOR- I N FORMAT I ON : 
NAME 

Caid; William R. 
Hecht-Neilsen; Robert 



CITY 

San Diego 
Del Mar 



STATE 

CA 

CA 



ZIP CODE 



COUNTRY 
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Abstract 

Automatic assessment of voice transmission quality is 
increasingly Important to users and providers of commu- 
nication services nnd products. Human listener panels 
currently provide subjective voice quality information. 
However, this approach can be slow, expensive, and 
unreliable. An objective method for assessing quality 
based on statistical pattern recognition and expert sys- 
tem techniques is described here. 



1. Introduction 

Background 

Assessment of voice transmission quality is important 
to users and providers of telecommunication services, as 
well as to standards organizations. Unfortunately, 
subjective assessment using human listeners can be 
expensive, slow, and unreliable. Development of an 
accurate and consistent objective method for voice 
quality assessment represents an important and wide- 
spread need. 

Recent efforts to develop an objective measure of 
voice quality arc based on parameters measured directly 
from the input and output speech [1-4]. While several 
proposed objective parameters have shown good correla- 
tion with subjective scores for some types of distor- 
tion, further study is required to determine their 
effectiveness over a wide variety of conditions and 
impairments . 

System Overview 

This paper proposes an Expert Pattern Recognition 
(KPR) system for objective voice quality assessment. 
The proposed system uses a combination of expert systems 
techniques and statistical pattern recognition (SPR) 
methods. It offers several potential advantages over 
previous methods based on a single voice parameter: 
(1) Tills is a multivariate technique that uses an 
optimized set of objective parameters for improved 
accuracy; (2) In addition to predicting Mean Opinion 
Score (MOS) . the system estimates opinion score 
frequencies, i.e. the probability of each quality class: 
P(oxcollont) , P(good), P(fair), P(poor), P (unaccept- 
able); (3) Expert system techniques provide the 
capability to logically integrate SPR results with other 
knowledge bases to account for factors such as 
transmission delay, which are not dealt with in the SPR 
system. 

The system first undergoes a training phase, which 
selects rules and facts for the expert system and sta- 
tistical parameters for the SPR design. Training 
requires a speech database containing a standard speech 
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source, processed speech representing impairments rele- 
vant to the intended applications, and complete subjec- 
tive scores for the processed speech. Using the 
training data, an optimum set of voice parameters is 
selected from over 7.00 candidates using a bottom-up 
evaluation technique described below. The parameter 
set's probability density function is estimated for each 
type of distortion using a Gaussian mixture model (i.e. 
weighted sum of Gaussian functions). These densities 
define the statistical parameters for the SPR system. 

In the second phase, standard voice source is played 
into the device or channel being tested, and an 
assessment is made on the output signal. Objective 
parameter values are measured and analyzed by the EPR 
system, which uses training statistics to produce the 
quality assessment. This process is illustrated in Fig. 
I and is described below. 



2. Parameter Evaluation 

The system is capable of using multiple voice 
parameters for its assessment. Many of the parameters 
used in this research were adapted from published work 
in voice quality, speaker Identification, and speech 
recognition. An evaluation algorithm is used to 
identify a small subset of these parameters that will 
provide the best possible performance. 

P arameter M easurement 

Parameters are measured on either the output 
(impnired) speech only, or they measure distortion 
between input and output speech. Some examples of mea- 
sured parameters are 

• Output speech parameters: linear predictive 
coding (LPC) related measures including partial 
correlation (PARC0R) coefficients, log area 
ratios, and cepstral coefficients; spectral 
measures such as spectral flatness and average 
power weighted frequency 

• Distortion parameters: signal-to-noise ratio; 
measures of spectral distortion; coherence- 
based measures such as signal-to-distortion 
ratio in particular frequency bands; cepstral 
distance 

Search Kethod 

Maximum system performance is realized only with an 
optimum set of objective parameters. Exhaustive evalua- 
tion of all possible combinations of parameters yields 
an optimal parameter set but is computationally pro- 
hibitive. Thus, a more useful approach is a suboptlmal, 
"bottom-up" search that examines only selected combina- 
tions (51. The algorithm successively adds parameters 
that most improve performance and removes parameters 
that are no longer useful. 

There are several metrics for evaluating performance. 
Two useful metrics are chi- squared error and mean- 
squared error of predicted MOS. Chi -squared error is 
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•measured between the predicted opinion score frequencies 
and the known listener panel opinion score frequencies. 
It measures the system's ability to model the listener 
panel and provides an overall measure of prediction 
accuracy. In contrast, the mean- squared error metric 
only measures how veil the system prediccs mean opinion 
score and is therefore less useful than the chl~squared 
error metric. 



3. Voice Quality Classification and Prediction 

Voice quality prediction is accomplished by analyzing 
training data from representative types of distortions. 
A statistical assessment is formed from the estimated 
probability density functions of the voice parameters. 
This assessment consists of estimated opinion score 
probabilities, mean opinion score, and most likely 
opinion score. 

Distortion Probability 

A vector of parameters measured from the i-th frame of 
the test speech (unknown voice quality), X, is 
represented by aq. pCxjJdn,) is the conditional 
probability density function of jq for the m-th dis- 
tortion. It is estimated using a k-nearest neighbor 
method (6] by 



The minimum mean-squared error prediction of Mean 
Opinion Score is found as: 



p(x i |d m ) - (k-1) / (N dm - v(x;)). 



(1) 



Here, N dm is the total number of training samples in the 
m-th distortion, and v(xj) Is the volume of a hypersphere 
with radius equal to the distance to the k-th nearest 
vector belonging to distortion d^. A quick multimodal 
estimate of p(x;|d m ) can also be formed by modeling the 
density as a Gaussian mixture. Typically, (1) is used 
during feature evaluation and selection, while the 
Gaussian mixture is used for classifier design. 

The probability of distortion d„, Is given by the Bayes 
relationship 

p(x t |d m ) • P(d m ) 



? pCxildj) • P(dj) 



(2) 



where ?(<!„,) is the prior probability of the ro-th dis- 
tortion. 

Opinion Score Pro bability 

An estimate of the opinion score probability function, 
P(u q |x{}, can now be obtained. This is the probability of 
opinion score w q , where the classes range from q-1 (unac- 
ceptable) to q-5 (excellent). This function is 
interpreted as the predicted frequency of listener panel 
scores corresponding to test speech, xj. The 
relationship is given by 



Fd.Jxi) - Z P(dJ*i) • P^jxi.d^) 



(3) 



where P(u q jx s ,d ro ) is the probability that opinion score w q 
is chosen given distortion d m and parameter vector Xj. 
This function is approximated by the listener-panel 
relative frequencies for distortion given by 

Classification and Mean Opinion Score Prediction 

Having estimated class probability (3), we can now 
obtain a minimum probablllty-of -error classification of 
voice quality. This is given by the w Q that gives the 
largest value of P(<i>J x,) . It represents the quality 
level most likely to be selected by a listener panel 
member. In n similar fashion, (2) can be used to choose 
the most likely distortion. 



MOS, - Z q • P(w,|X,) 



XOS - ( Z MOSj ) / N 



(4) 



(5) 



where N is the number of parameter vectors used in the 
assessment. 

Expert System 

An expert system is being developed to handle impair- 
ments that are difficult to deal with using standard 
objective techniques. Examples include infrequent pops 
and clicks, delay and echo, time warping, and effects of 
TASI (time assignment speech interpolation) circuits. 
Other factors affecting voice quality are application 
dependent and may not be measurable from the speech. 
These Include the user's nationality, hearing 
impairments, and expectations about grade of service. 
Rules and facts about human perception are being 
identified to specifically deal with these issues. The 
resultant knowledge base will embody information from a 
wide variety of studies dealing with the effects of such 
factors on voice quality. The expert system will use 
this knowledge base, along with the SPR assessment of 
each segment group, to create a final, overall quality 
classification. 



4, Example Results 

Figs. 2 through 5 demonstrate example results 
representing the major elements of the system. Fig. 2 
shows results of applying the bottom-up evaluation 
procedure to identify an effective parameter set. Mean- 
squared error of predicted MOS is plotted against the 
number of iterations. Each Iteration either adds or 
removes a parameter from the current parameter set in 
order to minimize error. For example, in the first 
iteration, the parameter LPC-04 is added and in the 
fifth iteration, parameter AUTOCR05 is removed. The 
algorithm terminates on the seventh iteration when it 
can "find no way to further reduce the mean- squared 
error. 

Fig. 3 shows decision boundaries resulting from 
training the Bayes classifier. The speech parameter set 
consists of LPC-01 (horizontal axis) and LPC-05 
(vertical axis). Cluster analysis results are indicated 
with unit standard deviation ellipses to represent 
cluster locations. These are used to form a Gaussian 
mixture estimate of the probability density for each 
impairment. Three types of training data are used: 
speech plus 0 dBRNC* ("A" ellipses), 25 dLBRNC ("B" 
ellipses), and 45 dBRNC ("C" ellipses) added noise. The 
most likely opinion score was derived using (2) and (3) 
and is depicted on the figure by different cross- 
hatching. Note that "A" ellipses (no added noise) are 
mostly located in regions of excellent (class 5) or good 
(class 4) quality. Similarly, "B" (25 dBRNC added 
noise) and "C" (45 dBRNC added noise) ellipses are 
located in regions of lower quality. As Fig. 3 shows, 
(3) the decision boundaries are also extrapolated into 
areas containing no training data. 



dBRNC refers to decibels above reference noise, 
C-message weighted. 
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Equations (1) through (5) are based on che assumption 
that parameter measurements are statistically 
stationary. Unfortunately, normal speech is only 
locally stationary, and this reduces system performance. 
The effect of nonstat ionari ty is lessened by segmenting 
the speech prior to training. Each segment type is 
composed of statistically similar frames of speech. 
These are chosen using cluster analysis techniques. 
Fig. 4 shows an example of segmentation into five types, 
where five was chosen arbitrarily. Vertical lines 
relate the speech signal, on top, to the corresponding 
segment types, below. The EPR system Is applied 
separately to each segment type, yielding multiple 
assessments that must be combined into a single quality 
prediction. This can be done using regression or expert 
system techniques. As expected, preliminary results 
indicate that segmentation improves assessment accuracy. 

Fig. 5 shows the system's assessment of speech plus 
added noise. For this example, only four impairments 
were used to train the system. The predicted opinion 
score histogram Is shown on top. A histogram of the 
listener panel opinion scores immediately follows. Also 
shown are predicted and actual MOS values, most likely 
opinion score, and distortion probabilities for the 
impairments used to train the classifier. 



5. Conclusions 

This paper twis described a new technique for process- 
ing objective parameters to obtain voice quality 
assessment. The method combines statistical pattern 
recognition techniques with a knowledge -based expert 
system. The SPR module uses optimally selected voice 
parameters to form a purely statistical assessment of 
voice quality. Output includes a classification of the 
most likely opinion score, a prediction of mean opinion 
score, and the predicted opinion score frequencies. An 
expert system is being developed to combine these 
results with additional knowledge bases to account for 
factors such as delay, which are not accounted for by 
the SPR system. 
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Fig. 1. System Overview. 




Fig. 2. MOS prediction error for parameter sets se- 
lected during a single bottom-up evaluation. 



210 




Pig. 3. Decision regions for most likely opinion score. Ellipses 
represent approximate training data locations for 0 dBRNC ("A"), 25 
dBRNC ("B"), and 45 dBRNC ("C") added noise. 




44000 47001 

Somple Number 



>^4 



Fig. 4. Example of speech signal (above) and corres- 
ponding segment types (below) . 
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Fig. 5. Example system output shoving predicted and 
listener panel quality assessments: opinion score 
frequency, H0S, most likely opinion score (class), 
and distortion probabilities for four training impair- 
ments . 
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